The Impact of Activation Sparsity on Overfitting in Convolutional Neural Networks

نویسندگان

چکیده

Overfitting is one of the fundamental challenges when training convolutional neural networks and usually identified by a diverging test loss. The underlying dynamics how flow activations induce overfitting however poorly understood. In this study we introduce perplexity-based sparsity definition to derive visualise layer-wise activation measures. These novel explainable AI strategies reveal surprising relationship between overfitting, namely an increase in feature extraction layers shortly before loss starts rising. This tendency preserved across network architectures reguralisation so that our measures can be used as reliable indicator for while decoupling network’s generalisation capabilities from its loss-based definition. Moreover, differentiable formulation explicitly penalise emergence during impact reduced on studied real-time. Applying penalty analysing well known regularisers common supports hypothesis effectively improve classification performance. line with other recent work topic, methods insights into contradicting concepts capacity demonstrating dense enable discriminative learning efficiently exploiting deep models without suffering even trained excessively.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

the impact of e-readiness on ec success in public sector in iran the impact of e-readiness on ec success in public sector in iran

acknowledge the importance of e-commerce to their countries and to survival of their businesses and in creating and encouraging an atmosphere for the wide adoption and success of e-commerce in the long term. the investment for implementing e-commerce in the public sector is one of the areas which is focused in government‘s action plan for cross-disciplinary it development and e-readiness in go...

The Power of Sparsity in Convolutional Neural Networks

Deep convolutional networks are well-known for their high computational and memory demands. Given limited resources, how does one design a network that balances its size, training time, and prediction accuracy? A surprisingly effective approach to trade accuracy for size and speed is to simply reduce the number of channels in each convolutional layer by a fixed fraction and retrain the network....

متن کامل

the impact of musical texts on the text recall of young learners of english in isfahan junior high schools

abstract although music possesses some kind of power and using it has been welcome by many students in language classrooms, it seems that they take a non-serious image of the lesson while listening to songs and they may think that it is a matter of fun. the main objective of the present study was to investigate whether learning a foreign language through musical texts (songs) can have an impac...

15 صفحه اول

the impact of morphological awareness on the vocabulary development of the iranian efl students

this study investigated the impact of explicit instruction of morphemic analysis and synthesis on the vocabulary development of the students. the participants were 90 junior high school students divided into two experimental groups and one control group. morphological awareness techniques (analysis/synthesis) and conventional techniques were used to teach vocabulary in the experimental groups a...

15 صفحه اول

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2021

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-68796-0_10